On Continuous-Action Q-Learning via Tile Coding Function Approximation
نویسنده
چکیده
Reinforcement learning (RL) is a powerful machine-learning methodology that has an established theoretical foundation and has proven effective in a variety of small, simulated domains. There has been considerable work on applying RL, a method originally conceived for discrete state-action spaces, to problems with continuous states. The extension of RL to allow continuous actions, on the other hand, has seen relatively little research. One proposed approach to allowing continuous actions is to represent the value function using a tile-coding function approximator. We introduce a simulated domain for the controlled study of this method in conjunction with Q-learning and report empirical results on its performance under different parameterizations. Our experimental findings contribute a deeper understanding of the workings of tile coding in continuous-action domains, provide guidance to parameter choices, and point out an improvement on this method which we verify empirically.
منابع مشابه
Tile Coding Based on Hyperplane Tiles
In large and continuous state-action spaces reinforcement learning heavily relies on function approximation techniques. Tile coding is a well-known function approximator that has been successfully applied to many reinforcement learning tasks. In this paper we introduce the hyperplane tile coding, in which the usual tiles are replaced by parameterized hyperplanes that approximate the action-valu...
متن کاملFunction Approximation via Tile Coding: Automating Parameter Choice
Reinforcement learning (RL) is a powerful abstraction of sequential decision making that has an established theoretical foundation and has proven effective in a variety of small, simulated domains. The success of RL on realworld problems with large, often continuous state and action spaces hinges on effective function approximation. Of the many function approximation schemes proposed, tile codi...
متن کاملReinforcement Learning applied to Keepaway, a RoboCup-Soccer Subtask
This Bachelor Final Project aims to be a demonstration of the power and usefulness of reinforcement learning, especially for RoboCup-Soccer. In the first part general theories behind reinforcement learning are described. Different kinds of basic solving methods are compared; Value iteration, Policy iteration, Monte Carlo methods, Sarsa and Q-Learning. Eligibility traces are added to the basic m...
متن کاملA Q-learning Based Continuous Tuning of Fuzzy Wall Tracking
A simple easy to implement algorithm is proposed to address wall tracking task of an autonomous robot. The robot should navigate in unknown environments, find the nearest wall, and track it solely based on locally sensed data. The proposed method benefits from coupling fuzzy logic and Q-learning to meet requirements of autonomous navigations. Fuzzy if-then rules provide a reliable decision maki...
متن کاملAdaptive Tile Coding for Value Function Approximation
Reinforcement learning problems are commonly tackled by estimating the optimal value function. In many real-world problems, learning this value function requires a function approximator, which maps states to values via a parameterized function. In practice, the success of function approximators depends on the ability of the human designer to select an appropriate representation for the value fu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004